Detection of Glottal Closing and Opening Instants Using an Improved Dypsa Framework

نویسندگان

  • Mark R. P. Thomas
  • Jon Gudnason
  • Patrick A. Naylor
چکیده

Accurate estimation of glottal closure instants (GCIs) and opening instants (GOIs) is important for speech processing applications that benefit from glottal-synchronous processing. This paper proposes a novel improvement to the DYPSA framework, based upon a multiscale analysis technique and an accurate estimation of glottal volume velocity. This replaces the linear prediction residual for candidate selection and enables the reliable detection of both GCI and GOI candidates. A two-stage dynamic programming process then detects the GCIs and removes them from the candidate set, before detecting GOIs from the remaining candidates. A postprocessing step improves GOI detection using the estimated GCIs. Evaluation against hand-labelled data on a large speech database shows that GCI detection is marginally improved compared with original DYPSA at 96% but, more importantly, shows that GOI detection can be achieved to a similar accuracy of 95%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Glottal closure and opening instant detection from speech signals

This paper proposes a new procedure to detect Glottal Closure and Opening Instants (GCIs and GOIs) directly from speech waveforms. The procedure is divided into two successive steps. First a mean-based signal is computed, and intervals where speech events are expected to occur are extracted from it. Secondly, at each interval a precise position of the speech event is assigned by locating a disc...

متن کامل

Voice source cepstrum processing for speaker identification

Voice source analysis and modelling has played a key role in important speech applications such as speech recognition, speech synthesis and speaker recognition. This work presents a robust algorithm for glottal closure detection and a novel set of voice source features for speaker recognition. In the rst part of the dissertation the DYPSA algorithm is developed for detecting glottal closure ins...

متن کامل

On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation.

Electroglottography is a common method for providing noninvasive measurements of glottal activity. The derivative of the electroglottographic signal, however, has not attracted much attention, although it yields reliable indicators of glottal closing instants. The purpose of this paper is to provide a guide to the usefulness of this signal. The main features that are to be found in this signal ...

متن کامل

Local regularity analysis at glottal opening and closure instants in electroglottogram signal using wavelet transform modulus maxima

This paper deals with singularities characterisation and detection in Electroglottogram (EGG) signal using wavelet transform modulus maxima. These singularities correspond to glottal opening and closure instants (GOIs and GCIs), Wavelets with one and two vanishing moments are applied to EGG signal. We show that wavelet with one vanishing moment is sufficient to detect singularities of EGG signa...

متن کامل

Detection of glottal opening instants using Hilbert envelope

The objective of this work is to develop an automatic method for estimating glottal opening instants (GOIs) using Hilbert envelope (HE). The GOIs are secondary major excitations after glottal closure instants (GCIs) during the production of voiced speech. The HE is defined as the magnitude of complex time function (CTF) of a given signal. The unipolar property of HE is exploited for picking the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009